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ABSTRACT 

We present measurements of the angular correlation function of galaxies selected from the first field of the H- ATLAS survey. Careful removal of 
the background from galactic cirrus is essential, and currently dominates the uncertainty in our measurements. For our 250 yum-selected sample 
we detect no significant clustering, consistent with the expectation that the 250 /jm-selected sources are mostly normal galaxies at z < 1. For our 
350 yum and 500 /jm-selected samples we detect relatively strong clustering with correlation amplitudes A of 0.2 and 1.2 at 1', but with relatively 
large uncertainties. For samples which preferentially select high redshift galaxies at z ~ 2 - 3 we detect significant strong clustering, leading to an 
estimate of ro ~ 7 - 1 1 /i"' Mpc. The slope of our clustering measurements is very steep, 6 ~ 2. The measurements are consistent with the idea that 
sub-mm sources consist of a low redshift population of normal galaxies and a high redshift population of highly clustered star-bursting galaxies. 
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1. Introduction 

Submillimetre (sub-mm) selected galaxy samples provide a 
unique way to trace obscured star formation out to high redshifts 
(Blain et al. l2002l l. Models for the evolution of such populations 
disagree on the nature of the sub-mm sources at high redshifts, 
with some claiming that they are massive galaxies in the pro- 
cess of forming most of their stellar mass (Granato et al. 2004, 
Narayanan et al. 2009, Dave et al. 2010) while others model 
them as lower mass sources undergoing bursts of star formation 
with a top heavy IMF (Baugh et al. 2005, Lacey et al. 2009). 
While evidence on individual sources largely supports a mas- 
sive halo scenario (Dunne, Eales & Edmunds 2003, Swinbank 
et al. 2008, Michalowski, Hjorth & Watson 2010), the best way 
to measure the statistical halo properties of this population is 
to measure their clustering. The three-dimensional clustering of 
sub-mm galaxies provides information about the dark-matter ha- 
los that they populate, but direct measurements need distance es- 
timates for each galaxy, which we do not have for our sample. 
The angular clusteiing can be measured for flux-limited samples 
but, in order to compare with models, predictions are required 
for both the n(z) of flux limited samples and the intrinsic 3-d 
clustering of the galaxies. The sub-mm colours depend on the 
source redshift, and so selecting on colour can preferentially se- 
lect high or low redshift samples (see e.g. Amblard et al. 2010). 
Previous attempts to measure the clustering of sub-mm 
sources have been mostly based on catalogues which cover 
only very small areas. Despite predictions that sub-mm galax- 
ies should have high spatial clustering, their n(z) is broad and so 
previous work has had limited success in detecting a significant 



* Herschel is an ESA space observatory with science instruments 
provided by European-led Principal Investigator consortia and with im- 
portant participation from NASA. 



angular clusteiing signal (Blain et al. 2004, Scott et al. 2006, 
WeiB et al. 2010). A more recent approach has been to measure 
the power spectrum of larger area sub-mm maps from BLAST 
(Viero et al. 120091 Devlin et al. 2009). This analysis has found 
significant evidence for clustering, although at relatively low am- 
plitude. 

The Herschel ATLAS (H-ATLAS) (Eales et al. WiOS will 
survey over 550 deg^ in 5 wavebands at 100, 160, 250, 350 and 
500 yum. One field covering ~ 4° x 4° degrees was observed 
during the science demonstration phase of the mission, and has 
produced a catalogue of ~ 6600 sources with significance > 5cr 
above the combined instrumental and confusion noise. This rep- 
resents roughly 1/30 of the final H-ATLAS data-set. In this paper 
we present measurements of the angular correlation function of 
five flux and colour-selected samples of the H-ATLAS sources. 



2. H-ATLAS data and source catalogues 

H-ATLAS uses parallel scan mode observations performed with 
the ESA Herschel Space Observatory (Pilbratt et al. l20I0l l. pro- 
viding data simultaneously from both the PACS (Poglitsch et al. 
2010) and SPIRE (Griffin et al. 2010) instruments. The time-line 
data are reduced using HIPE. Maps are produced from the SPIRE 
data using a naive mapping technique after removing instrumen- 
tal temperature variations from the time-line data (Pascale et al. 
in prep). Noise maps are generated by using the two cross-scan 
measurements to estimate the noise per detector pass, and then 
for each pixel the noise is scaled by Npassesi where Np„sses is 
the number of detector passes. A false-colour image combin- 
ing the SPIRE 250, 350 and 500 fim maps is shown in Fig. [T 
The PACS H-ATLAS maps currently yield only a few hundrec 
sources, which is insufficient for us to attempt a clustering mea- 
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Fig. 1. False-colour image combining the SPIRE 250, 350 and 
500 jiva maps as blue, green and red respectively. The ragged 
edges show the individual scan-legs of the two scans. Galactic 
cirrus can be seen as patchy blue wisps over the field. 



(Li) S3S0>36mJy (350+) 




Fig. 2. The positions of sources in four subsamples (a) 5 250 > 
33mJy; (b) 5350 > 36mJy and 3cr in 5250 and 5 500 (the 350-1- 
sample); (c) 5 goo > 45mJy; (d) the colour selected sample 

5 500 /-S 250 > 0.75. 



surement. The clustering in the PACS bands will be investigated 
in a future paper using more data. 

Sources were identified in the SPIRE maps using a Multi- 
band Algorithm for source extraction (MADX, Maddox et al. in 
prep). First a local background is estimated from the peak of the 
histogram of pixel values in 30 x 30 blocks of pixels. This corre- 
sponds to 2.5' for the 250 jum map, and 5' for the 350 and 500 yum 
maps. The background at each pixel is then estimated using a bi- 
cubic interpolation between the coarse grid of backgrounds, and 



this value subtracted from the pixel. The filter scale was chosen 
to be as large as possible while still following the variations in 
the cirrus background. Since the local background is estimated 
from the peak of the flux histogram, it is insensitive to the pres- 
ence of resolved sources within the background block, so long 
as they do not cover a significant number of pixels in the block. 
This approach should remove the local background without re- 
moving flux from the resolved sources, and so should be less 
susceptible to removing real structure in the source distribution 
compared to standard Fourier filtering approaches. 

The background subtracted maps are then filtered by the es- 
timated PSF, including a local inverse variance weighting. The 
maps from all three bands are then combined with weights set by 
the local inverse variance, and also the prior expectation of the 
SED of the galaxies. We tried a flat-spectrum prior, where equal 
weight is given to each band and also 250 yum weighting, where 
only the 250 //m band was included. At the depth of the filtered 
maps source confusion becomes an issue in the longer wave- 
length bands, and the higher resolution of the 250 ;um maps out- 
weighs the signal-to-noise gain from adding in the other bands. 
The cuiTent catalogues use the 250 /im-only prior and we will 
revisit this issue in future data releases. 

All local peaks are identified in the combined PSF filtered 
map as potential sources, and a Gaussian is fitted to each peak to 
give estimates of the position at the sub-pixel level and the point 
source flux. The flux densities in other bands are estimated by 
using a bi-cubic interpolation to the position given by the com- 
bined map. To produce a catalogue of reliable sources, we select 
only sources that are detected at the 5-cr level in any of the bands. 
In calculating the cr for each source, we use the relevant noise 
map, and add the confusion noise to this in quadrature. The av- 
erage 1-cr instrumental noise values in the PSF-filtered maps are 
4, 4 and 5.7mJy beam"' respectively in the 250, 350 and 500 jum 
bands. We estimated the confusion noise from the difference be- 
tween the variance of the maps and the expected variance due to 
instrumental noise, and find that the 1-cr confusion noise is 5, 6 
and 7 mJy beam"' at 250, 350 and 500 /^m. The resulting total 
5-cr limits are 33, 36 and 45mJy beam"' (Rigby et al. in prep). 
Source counts from these catalogues are analysed by Clements 
et al. (120101 ). and are found to be consistent with previous mea- 
surements in these wave-bands (Patanchon et al. 12009b . 

We have selected five samples to use for our current clus- 
tering analysis. The first three use simple flux density cuts, as 
given in Table [T] The fourth and fifth samples are as defined 
by Amblard et al. (2010), who use the H- ATLAS colours to es- 
timate redshift distributions. The fourth sample, which we call 
350h- is > 5cr at 350;um with an extra constraint that the sources 
must also be detected at more than 3cr in the 250 //m and 500 jum 
bands, and the fifth sample adds a further constraint that the ra- 
tio 5 500/5 250 > 0.75. Requiring a detection at 500 yum tends to 
select higher redshift galaxies compared to a simple 350 fiva se- 
lection, and Amblard et al. estimate that the mean redshift of the 
350h- sample is 2.2 + 0.6. The 5500/^250 > 0.75 colour selection 
pushes to an even higher redshift, of 2.6 + 0.3. The positions of 
sources in four of the sub-samples are shown in Fig.|2] 



3. Measuring w{B) 

We measure the correlation function by counting pairs in the data 
as a function of angular separation and comparing to the num- 
ber of pairs in a random catalogue with similar boundaries and 
selection effects. The pair counts are combined to estimate the 
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Table 1. Subsamples used to measure w(0), and best- 
fit power-law parameters. A^ is the number of sources 
in each sample. A is the amplitude at 1' and 6 is 
the power-law slope. Aq.s and A2.0 are the amplitudes 
at r with the slopes fixed at 0.8 and 2.0 respectively. 



Sample 



N 



10.8 



12.0 



'^250 > 33 
5350 > 36 
S 350 > 36* 
5500 > 45 



6317 -0.01+0.07 1.7 ±0.2 -0.00 -0.01 

2754 0.20 + 0.07 2.0 + 0.2 0.11 0.20 

1633 0.50 + 0.09 2.8 + 0.5 0.21 0.50 

304 1.24 ±1.6 2.4+1.3 0.51 1.24 



550o/5250>0.75 808 0.92 + 0.3 2.1+0.5 0.38 0.92 

* this is the 350+ sample which has the additional constraint that source 
must be detected at > 3cr in the other two bands. 



correlation function, w{6), using the Landy & Szalay estimator 
(fT993l l 



w{ff) 



DD - 2DR + RR 
RR ■ 



(1) 



Here DD is the number of data-data pairs, DR is the number 
of data-random pairs and RR is the number of random-random 
pairs, each at separation 6. 

The random catalogues were generated to follow the sensi- 
tivity limit of the actual data selection. This means that any non- 
uniformities in the data due to variation in signal-to-noise should 
not be imprinted on the clustering signal. To relate the noise at 
a pixel to the expected number density of sources we generated 
random fluxes which match the observed count slope (Clements 
et al. 120 101) . perturbed them by a Gaussian deviate with standard 
deviation equal to the local noise estimate, and then kept the 
random source if it was brighter than the chosen flux limit. In 
practice the noise maps are uniform enough that using uniform 
random catalogues makes no significant difference to the results. 

The clustering measurements are sensitive to the correct re- 
moval of the spatially varying cirrus background, as well as the 
unresolved background of faint sources, which are also likely to 
be strongly clustered. We have investigated the stability of the 
measurements by masking areas around the brightest patches 
of cirrus. This had little effect on the measurements indicating 
that our cirrus removal is effective. Increasing the scale of back- 
ground filtering to 60 and 120 pixels produces a much larger 
clustering signal. A visual inspection of the source positions 
makes it clear that the excess structure in the source distribu- 
tion is coiTelated with the pattern of ciiTus emission, and so is 
likely to be spurious signal caused by insuflicient background 
subtraction. 

A potential concern when using such a small scale to remove 
the background is that some real clustering may have been re- 
moved. This was tested by using clustered source positions to 
create simulated maps, which include cirrus background esti- 
mated from the IRAS maps (Schlegel Finkbeiner & Davis [T998t 
and the same noise and coverage maps as the real data. The back- 
ground was then filtered and sources extracted using the MADX 
algorithm as for the real data, and the clustering of the resulting 
sources measured. The clustering amplitude recovered from the 
simulations varied with background subtraction scale in a sim- 
ilar way to the real data. The coiTect amplitude was recovered 
using 30 pixels; using 15 pixels underestimated the amplitude 
by ~ 10%. We therefore believe that our background subtraction 
has removed the effect of cirrus on the source clustering, yet has 
not removed true structure in the source distribution. 



Our measurements of w{9) are shown in Fig. |3] The panels 
(a) and (c) show the flux limited samples in the 250 and 500 fim, 
bands while panel (b) shows the 350+ sample and panel (d) the 
5' 500/^5 250 > 0.75 colour selected sample. The error bars on the 
plots are estimated from the Poisson noise in the pair counts. We 
fitted the data using a simple power law of the form w{ff) - A6^^, 
coiTected for the integral constraint using the Roche and Bales 
(1999) technique. The power law slopes, 6 and amplitudes at 1 
arc minute, A are given in TablJT] Uncertainties on these mea- 
surements were estimated by fitting power laws to Monte-Carlo 
realizations of the data, and measuring the standard deviation of 
the resulting parameters. 



4. Discussion 

The 5250 sample has no detectable clustering signal. The sim- 
ple 5350 flux limited sample does show fairly significant clus- 
tering at scales < 2', but the 350+ sample produces a higher 
amplitude and more significant detection, as expected given that 
adding cuts in the other two bands leads to a narrower n(z) by re- 
moving low-z galaxies. The 5 500 sample gives a noisier w{6), but 
also shows a high amplitude. The colour-selected sample with 
5500/^250 > 0.75 shows a higher amplitude, and higher noise 
compared to the 350+ sample. The power-law fits to all samples 
give steep slopes 6 ~ 2. 

The increasing amplitude in samples selected at longer 
wavelengths suggests that clustering is stronger in the higher 
redshift populations, since selection at longer wavelengths tends 
to favour higher redshift galaxies. According to some models, 
these are highly clustered star-burst galaxies which are the an- 
cestors of present day ellipticals (Negrello et al. l2007l Dave et al. 
,20101). The two samples with additional color cuts have n(z) from 
photometric redshifts derived by Amblard et al. 2010. which can 
be used to convex! the angular amplitudes to spatial ro. For both 
samples we find a range of ro ~ 7 - 1 1/z ' Mpc, depending on the 
slope used (S = 2 ot 6 - 0.8 respectively). While the slope we 
measure at scales of a few arcmin is universally steep we cannot 
be sure with the cuiTent data-set what the behaviour will be at 
larger scales. A larger data set is required to fully address the 
behaviour of the slope and hence reduce the uncertaintites in ro. 

At first sight, the non-detection of clustering in the 250 jum 
sample is somewhat surprising. It contains a high fraction of 
lower redshift galaxies, with > 30 percent at z < 1 (Smith et 
al. in prep). This low-z population is expected to cluster in a 
similar way to local optical galaxies. Galaxies in the SDSS with 
magnitudes 21 < r* < 22 have a coiTelation amplitude of 0.046 
at r (Connolly et al. 2002). Also most of the 350 jum sample 
is a subset of the 250 yum sample, so their clustering will con- 
tribute to the 250 yum clustering. Assuming they are uncorrected 
with the low-z galaxies, they will contribute an amplitude scaled 
down by the relative density squared, leading to an expected am- 
plitude ~ 0.03. These relatively small amplitudes are consistent 
with our measurement of -0.01 + 0.07. 

Overall, these results are consistent with the general expec- 
tation from models which include highly clustered high red- 
shift and weakly clustered low redshift populations of sub-mm 
emitting galaxies. Models which postulate that sub-mm sources 
have a higher mass IMF than normal (Baugh et al. 2005, Lacey 
et al. 2010) predict a higher clustering strength at lower red- 
shifts, which seems at first glance to be at odds with our result. 
However, these models do predict far stronger clustering above 
a threshold in luminosity which is redshift dependent. It is possi- 
ble that our high-z samples exceed this threshold while the low-z 
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Fig. 3. Plot of ^(6*) for four subsamples discussed in the text. The panels show (a) 5 250 > 33mJy, (b) 5 350 > 36mJy with 3(t 
detections in the 250 jiva and 500 /im bands (the 350+ sample), (c) 5 500 > 45mJy, and (d) 5350 > 36mJy with 3cr and 5'5oo/5'25() > 
0.75 colour selected sample. The error bars on the plots are estimated from the Poisson noise in the pair counts. Note the expanded 
scale on panel (a). 



samples do not. It is beyond the scope of the current data set to 
be able to confirm or rule out these models. 

There are few previous observations to compare to directly, 
and those that are available have rather different selections and 
redshift distributions. Magliocchetti et al. ( I2008I I analysed the 
distribution of bright 24 yum sources with faint optical counter- 
parts, split into high {{z) ~ 2) and low redshift ((z) ~ 0.8) sub- 
samples. They found a low amplitude (A ~ 0.14 ± 0.05) at low 
redshift and a higher amplitude (A ~ 0.26 + 0.1) at high redshift. 
This trend of w{6) increasing towards higher z is similar to our 
observations. 

The power spectrum analysis of the BLAST data by Viero 
et al. ( 120091 ) detects clustering on scales 5' < < 20'. Once 
interpreted within the Halo Model formalism, their measure- 
ment points to an increase in the spatial clustering of the back- 
ground sub-mm source population with increasing wavelength 
and therefore increasing redshift. Again this is consistent with 
our findings. 

LABOCA observations of the extended Chandra Deep Field 
South have produced a catalogue of 126 sources selected at 
850 jum (WeiB et al 120091) . A weak detection of clustering is 
found on scales 6 < 2'. Fixing the slope to be 0.8, their power- 
law fit has an amplitude of 0.18 + 0.1 at 1'. This is similar to the 
amplitude that we find for the 350 //m selected sample. 

Though all three of these measurements are similar to ours, it 
is not simple to make a direct comparison because either or both 
of the flux-limit or pass-bands are different, and so the redshift 
distributions are not the same. 

Given the statistical limitations on the current small field, 
we leave detailed comparisons to models to a later analysis with 



more data. However we can say that our measurements appear to 
be consistent with the prevailing models, where sub-mm sources 
consist of a low redshift population of normal star-forming disk 
galaxies that have a spatial correlation length, ro ~ 4 Mpc, and a 
high redshift population of highly clustered star-bursting galax- 
ies ro ~ l OMpc (Negrello et al. 2007, Narayanan et al. 20091 
Dave et al. 120101 ). The rather steep slope of our measurements 
for the higher redshift samples is also consistent with the high 
redshift population forming in compact protoclusters. 

The current field is only 1/30 of the area that H- ATLAS will 
cover, and it has the brightest cirrus background of the planned 
fields. The final H-ATLAS dataset will have much larger fields 
with lower cirrus emission, and so will provide a benchmark 
measurement of source clustering in sub-mm populations. 
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